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This paper is concerned with the probability, P[T,r(r)], that a stationary 
Gaussian process with mean zero and covariance function r(r) be nonnega- 
tive throughout a given interval of duration T. Several strict upper and lower 
bounds for P are given, along with some comparison theorems that relate 
P's for different covariance functions. Similar results are given for 
F[T,r(r)], the probability distribution for the interval between two successive 
zeros of the process. 

INTRODUCTION 

Let X(t) be a real continuous parameter Gaussian process, stationary 
and continuous in the mean. We shall assume throughout that 
EX(t) = and shall write r(r) = EX(l)X(t + r). We further assume 
throughout that we are dealing with a separable, measurable version of 
the process. 

Our main concern in this paper is the probability P[T,r(r)] that X(t) 
be nonnegative for ^ t ^ T. This quantity is of interest as a means of 
describing the duration of the excursions taken by the process from its 
mean. From P[T,r(r)], the distribution function F[\,r(r)] of the inter- 
val between successive zeros of the process can be determined by differ- 
entiation [see (19)]. This latter quantity is of importance in a variety of 
engineering applications of noise theory. 

Considerable effort has been directed in the past toward the numerical 
determination of F[\,r(T)] both theoretically 1 " 25 and empirically. 26 " 32 
These researches have resulted in various approximations for F[\,r(T)\, 
but many of these are neither upper nor lower bounds for F, and exact 
circumstances under which they are good approximations are not clear. 
Generally speaking, they are good for small values of X and become nuga- 
tory for sufficiently large A. There appears to be nothing rigorous in the 
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literature concerning the asymptotic behavior of F for large X. (An ap- 
proximation method is given in Ref. 21.) 

In this paper we summarize some known results and present a number 
of new strict bounds and comparison theorems for P[T,r(r)] and 
F[\,r(T)\. The most important of these are: Theorem 1, Section 1.3; 
Theorem 3, Section 1.4; and Theorem 10, Section 1.8. Theorems 12 and 
13 (Section 2.7) dealing with class 2 covariances (denned in Section 1.1), 
though of less importance for our goal, are perhaps of more than passing 
interest. These and other results presented shed some light on theoretical 
questions regarding P and F. Their utility in numerically determining 
these quantities will be discussed elsewhere. 

The paper is divided into two parts: Part I presents definitions, results, 
and discussions; Part II contains the more technical aspects of proofs 
and other supportive material for Part I. 

PART I — DEFINITIONS, RESULTS AND DISCUSSIONS 

l.i Preliminaries 

From its definition, it is clear that P[7>(t)] is a nonincreasing function 
of T. It assumes the value \ for T = 0. It obeys the scaling laws 

P[T,\r(r)\ = P[T,r(r)} (D 

P[T,r(*r)} = P[\T,r(r)} (2) 

X > 0. 

In asserting (2) for all X > we have assumed r(r) given for all r. 
This is a convention that will be adhered to throughout this paper. It is 
to be noted, however, that P[T,r(r)] for g T ^ T„ depends only on the 
"piece" of the covariance function r(r), ^ r ^ T • 

The scaling law ( 1 ) suggests normalizing the covariances to be con- 
sidered so that 

r(0) = 1. (3) 

We adopt this convention hereafter. 

The scaling law (2) suggests that a normalization of the time scale is 
in order. There does not appear to be a convenient way to do this for the 
class of all covariances. For processes continuous in the mean, such as 
are., being considered here, all one can say in general about covariances is 
that they are even continuous nonnegative-definite functions. This is a 
rather large class of functions containing a great variety of pathologies 
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such as nowhere differentiable continuous functions. In what follows we 
shall have occasion to consider co variances r(r), strictly monotone in 
some right neighborhood of t = and such that r(r) — 1 behaves like a 
nonnegative power of [ r | for sufficiently small | r | . We normalize and 
define this class as follows: The continuous covariance r(r) is said to be 
of class a if, as r approaches zero, 



rW = 1 -F(^rTJ + °^H. 



and if r(r) is strictly monotone in some right neighborhood < t < t„ 
of the origin. Here necessarily ^ a ^ 2 and T(a + 1 ) is the usual 
gamma function. The normalization is contained in the specific choice 
of the coefficient of | r \ a . 

To the author's knowledge, when the scaling laws (1) and (2) are 
taken into account, there are'only three distinct covariances for which 
P[T,r(r)\ is known explicitly. These are: 

(i) ri ( T ) = e -W 



< T < 30 



2 



P[2>i(t)] = - arcsin e 



Hi) r 2 (0,T) = 1 - s + 13- cos ( - 



^ T < oo; 

^ t < oo, 0^/3^1, 



P[T,r. 2 (0,r)] = 



(Hi) ?' 3 (r) = 



^-^-2^r a,TSin [ /3sin O] 



0*J*2*, 



Mi - A 

l - It I. 



0, 



|r|2S 1 
Ir I £ 1. 



1 



1 



/ J [7> 3 (r)] = - + — [arcsin (1 - 7') - y/T(2 - T)), Ogf^l. 

The process with covariance ri(r) is Markovian, and it is this special 
property that permits determination of P[7>i(t)] in this case (see Ref. 
22 or Ref. 21, Section IX). 

Case (it) corresponds to the stochastic process 

X(t) = A + Bco*[L + #1, 
with A, B and $ independent random variables, the two former being 
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normal with mean zero and variances 1 - /3 2 and 0" respectively, and the 
latter being distributed uniformly in (0,2t). The determination of P 
in this case is an exercise in integration and elementary probability the- 
ory that will be omitted here. For the obvious generalization of this case, 
namely, 

N 

X{t) = A + E #.- cos[t/Pi + *,], 
i 

P[T,r(r)] can be expressed in principle as a (2JV + 1 Mold integral. Ex- 
cept in the case N = 1 presented, the integrals appear untractable. 

The form for P[T,r 3 {T)] given follows from results found in Ref. 23. 
Note that it is valid only for T ^ 1. We have been unable to extend P 
beyond this point. 

These examples shed little light on the many questions that naturally 
arise concerning the behavior of P[T,r(r)], both as a function of T and 
as a functional of r(r). What are possible asymptotic behaviors of P 
for large T? What features of r(r) determine this behavior? To what 
extent is P determined by the behavior of r(r) in the neighborhood of 
t = 0? (For example, if r(r) is analytic in the neighborhood of t = 0, 
then it can be extended as a covariance in only one way, namely, by its 
analytic continuation. In this case, then, P[T,r(r)] is completely deter- 
mined by the behavior of r( t) near r = 0. ) If q( t) is another covariance, 
in some sense close to r(r) for ^ r g T, is P[T,r(r)] close in some 
sense to P[T,q(r)\f How can P[T,r(r)} be determined numerically for a 
given covariance r(r)? 

These and many other basic questions await to be answered in full. 

1.2 P[T,r(r)] as a Limit 

LetO = h < t 2 < ■■ ■ < t„ = T be a partition of the interval (0,T) 
into n — 1 parts. The n random variables X{ti), X(t 2 ), • • ■ , X(t n ) are 
jointly Gaussian with covariance matrix r = (r.y), where r,-> = r(U — tj). 
Denote by P„(t) the probability that these n random variables be non- 
negative. Because of the assumed separability of the process, 

P[T,r(r)] = limP„(r), (4) 

n-"» 

where it is understood that the limit is taken as the partition is refined 
with mesh tending to zero. If r(r) is positive definite, then | r | ^ for 
any choice of partition, and one can write explicitly 

P„(r) = (27r)- n/2 | r P* f <&, • • • f dM"**"" 1 '". (5) 

•'O •'0 
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It is somewhat surprising that information about P[T,r(r)\ is so difficult 
to obtain when it can be expressed as the limit of the apparently not too 
unwieldy expression on the right of (5). This integral is deceptive. For 
n > 3 it cannot be expressed in terms of elementary functions of the co- 
variance elements r,-y . Series expansions and upper and lower bounds can 
be easily written for this integral, but most of the obvious ones yield 
vacuous results in the limit as the partition is refined. 

The integral (5) admits of a simple geometric interpretation obtained 
by reducing the quadratic form in the exponent to a sum of squares by 
a linear transformation and performing a radial integration. P n (r) is the 
fraction of the unit sphere in Euclidean n-space cut out by n-hyperplanes 
through the center of the sphere. The angle 0,-y between the normals to 
the ith. and jth hyperplanes directed into the cutout region is given by 
cos 8ij = r ijt i,j = 1,2, • • • , n. This geometric interpretation of P„(r) 
holds even when | r | =0. For n = 2 and 3, this picture gives at once 

P2 = s~ W — 0i 2 ] = -. + — arcsin r n (6) 

p 3 = ~ [2tt - e 12 - 13 - 23 ] 

4r 

i i (7) 

= 5+7- [arcsin r J2 + arcsin r i3 + arcsin r M ]. 

Seen on the surface of the sphere, the region described above is the 
generalization of the spherical triangle in three-space and is known as an 
n-dimensional spherical simplex. Geometers have studied the problem 
of expressing the content of the spherical simplex in terms of the angles 
between its bounding surfaces. 33 " 38 Many of their results can be readily 
derived from known results in probability theory using the connection 
with P„(r) just mentioned (see Section 2.1). 

It is clear that P„(r) is an upper bound for P[T,r(r)]. The result (7) 
then is a simple upper bound for P[7',r(r)], where r 12 = r(t 2 — h), 
ri3 = r(t 3 — U), r 23 = r(/ 3 — to) and h , t 2 , t 3 are any three points in the 
interval (0,T). For very small values of T, this upper bound can be made 
close to the true value of P[T,r(r)]. For large values of T, this is gen- 
erally not the case. If, for example, r(r) is never negative, P 3 is always 
greater than J. If r(r) oscillates in sign, there is a minimum value for P 3 
different from zero (unless r(r) achieves the value —1) obtainable for 
any choice of h ^ k ^ h, and hence this bound for P[T,r( T )] does not 
approach zero for large T. 



408 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1962 

1.3 A Comparison Theorem for P[T,r(r)] 

Recall that in the geometric picture of P„(r), r {j = cos 9 it where d {j is 
the angle between the inward normals to hyperplanes i and j. Intuitively, 
it is clear that if this angle is decreased, i.e., if r u is increased, P„(r) 
should also increase. This is borne out by the following 

Lemma 1\ — Let P„(r) be the probability that n jointly Gaussian vari- 
ates with mean zero and normalized covariance matrix r(r« = 1) be non- 
negative. Let q be another normalized n X n covariance matrix. If Uj ^ g,-y 
for i,j = 1, 2, • • • , n, then P„(r) ^ P„(q). 

Note that the matrices r and q need only be nonnegative definite (as 
distinguished from positive definite). 

By regarding P[T,r(r)] as a limit of P„(r), as explained in the pre- 
ceding section, Lemma 1 can be used to deduce the following comparison 
theorem. 

Theorem 1 — If r{r) ^ q{r) for ^ r £ To , then P[T,r(r)] ^ 
P[T,q(T)}forO ^ T ^ T„. 

The covariance function of a process is generally regarded as a rough 
measure of how much the process "hangs together." This view is sup- 
ported by the theorem. A process with a larger covariance function 
hangs together more and is more likely to maintain the same sign than 
one with a smaller covariance. 

The comparison theorem can be used with the three covariances 
(Section 1.1) for which P[T,r(r)] is known exactly to bound this quan- 
tity for other covariances. The theorem is particularly useful for com- 
paring covariances of the same class. Let r(r) and q(r) both be of class 
a, and suppose that r(r) ^ q{r) in some neighborhood of the origin. 
Then P[T,r(r)] ^ P[T,q(r)] in this neighborhood. But, for any X > 
\,q(r) ^ Kat) in some sufficiently small neighborhood of the origin, 
so that also P[7',g(r)] ^ P[T,r(\ T )} = P[X7\r(r)] by the scaling law 
(2). Choosing X appropriately leads to the following 

Theorem 2 — Let r(r) and q(r) be of class a with r(r) ^ q{r) in some 
neighborhood of r = 0. Then for some T* > 0, 

P[T,r(r)} ^ P[T,q(r)] ^ P[r-\q(T)),r(r)\, ^ T ^ T*. 

The theorem is proved in Section 2.3 where the determination of T* 

and the choice of proper branch for r~\q) are also discussed. Knowledge 

of P[T,r(r)] thus provides both upper and lower bounds for P[T,q(r)\ 

near t = 0. 

t Proved in Section 2.2. A special case of this lemma was proved by J. Chover 4 
by a completely different method. He applied his result to obtain a weak version 
of our Theorem 1. Chover's result inspired much of the present paper. 
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1.4 Some Related Results Useful for Large T 

From Lemma 1, it is easy to deduce (see Section 2.4) 
Theorem 3 — Let 7\ ^ 0,T 2 ^ 0,T 3 ^ be such that T x + T 2 = T z . 
Ifr(r) £ 0/orO ^ r ^ T» , then 

P[Tz,t(t)] ^ P[Ti,r(r)}P[T. 2 ,r(r)\. (8) 

This theorem provides some asymptotic information on P[T,r(r)] for 
covariances that are never negative. It implies for these covariances 
that - (l/T) log P[T,r(r)] approaches a nonnegative limit as T becomes 
infinite. In this sense, then, for nonnegative covariances, P[T,r(r)] cannot 
decrease asymptotically more rapidly than exponentially. An exponential 
lower bound for these covariances is found by iterating (8). Thus, if 
T = NT , P[T,r(r)\ = P[NT , r(r)] ^ P\T , r{r)] N . One obtains in 
this manner the exponential bound 

P[T,r{r)] ^ P„P T o ITo T ^ T (9) 

which holds for nonnegative r(r) with P = P[T , r(r)], T > 0. 

For covariances for which P[T,r( r )] is not known, (9) still gives useful 
information by replacing P„ by a lower bound. For example, from the 
lower bounds presented below Theorem 6 in Section 1 .6, it follows that 
for nonnegative t(t) of class 2, P[T,r(r)] ^ f(T) where 

f(T) ~ L "J, (10) 

5 < t < - 
2 "" 2 ' 

By choosing T to maximize f(T ) llT ° and using this maximum value for 
P in (9), one obtains the following 

Lower Bound — // r(r) is of class 2 and nonnegative, then 

P[T,r(r)} ^ 0.121 e~ 2miT "\ T ^ (I.OIG)tt. 

For a specific nonnegative covariance of class 2, a somewhat smaller 
exponent can often be obtained by using for / the lower bound of Theo- 
rem 0, or a lower bound obtained from the comparison theorem and 
example (ii) of Section 1.1. 

For covariances (such as r-i(r) of Section 1.1) that are identically zero 
for r ^ 7'i for some T A > 0, an exponential upper bound can readily be 
written for P[T,r(r)]. For example, if T = (2N - \)1\ , then 
P[(2N — 1 )Ti , )-(t)\ is certainly not greater than the probability that 
the process be nonnegative in the intervals (O,? 1 ,), (27'! , ZTi), • ■ , 
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((2A7 - 2)7 7 i , (2A/ - 1)7\). But the process in anyone of these inter- 
vals is independent of the process in the other intervals because of the 
vanishing of r(r) for r ^ T, . Thus, P[T,r(r)} ^ {P[T t , r(r)]) N . 
Arguing in this manner, one arrives at the 

Upper Bound — If r(r) vanishes for t ^ 7\ , then 

P[r,r(r)} ^^Pr T \ T*Ti, 

where Pi = P[Ti , r(r)]. 

1.5 Bounds from Rice's Series 

Let = k < h < ■ ■ ■ < l n = T be a partition of the interval (0,T) 
into n - 1 parts. Let A { denote the event: "X(t) changes sign at least 
once in the interval U g t < t i+1 ," i = 1,2, ■ • ■ ,n - 1. Then, by the 
method of inclusion and exclusion, 

2P[T,r(r)} = 1 - EPrM.-l + EPrM.riily} 
- 2 PriAiOAiOAk) 

+ ... + (-i) n - 1 p r {^ 1 n^ 2 n ■•• ciAn-j, 

is the probability that none of the events A { occur. If r"(0) exists, the 
above series approaches as a limit as the partition is refined with mesh 
tending to zero 

2P[T,r( T )} = 1 - f qi(ti)dh + | / dh f dkq,{h,k) - ••• s 
(compare Rice, 19 Equation 3.4-11) which we write as 

2P[T,r{r)\ = 1 + t ^^ , 

1 r (U) 

B n = I dh-" I dt n q n (ti, •■• , t n ). 
•>o Jo 

Here q n (h , •■■ ,t„)dh- ■ dt„ is the probability that X(t) has one or more 
zeros in each of the intervals (h , h + dh),- • •,(/„ + dt n ). The existence 
of r"(0) assures us that X(t) has a derivative almost everywhere in 
(0,T) for almost all sample functions. One then has 

q n (h, ■•- l O-/ ii *i-"jL > ^lfe"-fcl (12) 

•[p(?l, ••• ,Sn,Xl, ••• ,X„)]«..-0. 
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Here p(£i , • • -,£,, , x x , • • -,.**•„) is the joint, density for the random vari- 
ables X'(t 1 ),---,X'(tn),X(l i ),---,X(t n ) with fc associated with X'(U) 
and Xi associated with X(U),i = 1,2,- • • ,n. X'(t) is the derivative of 
X(t) with respect to t. 

From the derivation of the method of inclusion and exclusion, suc- 
cessive partial sums of (11) alternately overestimate and underesti- 
mate 2P(T). We therefore have the sequence of bounds 

^ 2P[T,r(r)] ^ 1, 

1 - — * < 2Prrr(r)l < 1 - — + — 2 

1! " - r[1 ' nT,i = i! +2! ' (13) 

1 _ fl + *» _ h < 2P[l\r(r)\ <l-ft + ft-ft + ft 

etc. Unfortunately, except for n = 1,2,3, the integrand q n (ti , • • • ,t n ) 
occurring in the definition of B n cannot be expressed in terms of ele- 
mentary functions. For covariances r(r) of class 2, one has 

fctti) = - , 

w 



where 



and 



nl . . v 1 v u \\/\ — a 2 + a arcsin or] 

7r- (1 — a 2 ) 3 ' 2 



M = (1 - r 2 )(l - r"' 1 ) - r'\2 + 2rr" - r' 2 ), 
a = [(1 - r 2 )r" + rr'*]/[l - r 2 - r'% 

r = r(t, - t x ), r' = r'{t, - h), r" = r"(* 2 - t l ). 



The expression for q 3 is too complicated to warrant display here. 

Bounds given by partial sums such as (13) cannot be expected to 
yield useful results for large T. Typically, for large T, B n behaves like 
T n : the upper bounds exceed unity for large T and the lower bounds 
become negative. 

For small T, however, (13) yields useful information. One has 

T 
Bi = - . 

TV 

If r(r) = 1 — t'/2 + ct /4! + ()(/'), a very tedious computation shows 
that for small T, 
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24 t 

B 3 = OCT 6 ). 

From this and the inequalities (13) follows 
Theorem 4 — // for small r 

r(r) = 1 - £ + fj + 0(r 6 ), 

tfam toe ./*rsf tfwee right-hand derivatives of P[T,r(r)] with respect to T 
exist at T = and are given by 

P[0,r(r)} = h 
P'[0+,r(r)] = -i, 
P"[0+,r(r)] = 0, 

P"T0+^(r)]-|^. 

The assumed form for r(r) in Theorem 4 is important. It has been 
shown by Longuet-Higgins 14 that if r(r) = 1 - r 2 /2 + 6 | t | 3 + 0(r ), 
b * 0, then for small T, B n = 0(T 2 ) for n = 2,3,4,- • •. One can only 
conclude in this case that P'[0+,Kt)] = - 1/2tt. 

The power series 1 + £i" B n \"/n\ can be written formally as 

exp S cX/n. 
i 

Expand the latter in a power series, equate coefficients of like powers 
of X and set X = — 1. There results the formal identity using (11) 

2P[r,r(r)J = 6 -H-.«-*«+- , (14) 



where 



T 
d = Bi - - 

7T 



c 3 = B 3 - 3fliB s + 2Pi 3 
C4 = #4 - 4BiB 3 - 3Ba' + 12Bi 2 B 2 - 6fli 4 , 
etc., with the 5's given by (11) and (12). Relations (15) are the usual 
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ones connecting semi-invariants with central moments (see Ref. 39, p. 
37 or Ref. 40, p. 180). Kuznetsov, Stratonovich and Tikhonov" have 
suggested the use of (14) keeping a finite number of c's as a better ap- 
proximation to P than series (11). For large T, (14) will perhaps yield 
a better approximation than (11), but it is difficult to see under just 
what circumstances this will be true. A knowledge of the asymptotic 
behavior of the c's for large T is needed, but this appears to be a difficult 
point. 

A truncated form of (14) will not in general yield the correct asymp- 
totic behavior of P[T,r(r)]. For example, retaining only ci , (14) gives 
2 P[T,r(r)] ~ e~ Tlr for all class 2 covariances. That this is not in general 
correct can be seen from a family of simple counterexamples. If q(r) is 
of class 2, then so is 

r*(r) = q (ar) S ^, (1(5) 

where a = y/\ - 2 /3 and < ^ \/3- If X(t) has covariance r*{ T ), 
then since r*(wr/j8) = 0,n = ±1,±2,- ■ •, the random variables 

X(T/0),X(2w/P),X@r/0),'.' 

arc independent. Set N = \0T/v]. Then 

P[T,r*( T )} ^ Pr[X(jw/0) ^ 0,j = \,---,N\ = (*)* 
< 2(A)" 77 * = 2e~ {& log 2)r/,r 
Thus if 

V* = l-"32 ^ (3 > pi- = 1.442, (17) 

log 2 

e"P[T,r*(T)] approaches zero exponentially for large T, and the first 
term in the exponent of ( 14 ) yields an incorrect asymptotic behavior. 

It is interesting to note that the form e~ Tlx obtained from (14) by 
retaining only Ci would be correct for a process in which the axis cross- 
ings were independent. One would then have q n {t x , • • ■ ,t„) = Uft(fy), 
J5„ = (Bi)" and c„ = 0,n > 1. For processes with the covariance (10) 
with /3 given by (17), P[T,r*(r)] decays even more rapidly. This has 
nothing to do with the asymptotic behavior of /•*: by proper choice of 
q(r), this can be altered at will. One must suppose this rapid decay of 
P[T,r*( T )] is due to the fact thai typically r*{ T ) takes negative values 
so that at certain time separations the process is anticorrelated. Indeed, 
it is tempting to conjecture that for nonnegative class 2 covariances, 
e "P[7\r{r)] increases without limit for large T. 
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1.6 Some Other Bounds for P[T,r(r)] 

In this section we list a few miscellaneous bounds on P[T,r(r)]. 

Theorem 5 — 

2 f 1 
P[T,r(T)] ^ - / (1 - w) arcsin r(Tu) du. 

The theorem is proved in Section 2.5. If r arcsin r(r) is integrable, the 
bound in Theorem 5 approaches zero like \/T. 

Lower bounds for P[T,r(r)] are difficult to obtain. One is given by 
(see Section 2.6) 

Theorem 6 — If r(r) is of class 2, 

P[T,r(r)] ^ 1 - ¥- + r arcsin r(T). 

This bound goes negative for relatively small values of T (at least be- 
fore T = 2r). It gives somewhat more information than the bound 

P\T,r{r)\ ^ \ [l " l\ , (18) 

obtained from Rice's series (Section 1.5) by retaining only fii . The 
bound obtained by retaining Bi , B 2 and B 3 is of course generally much 
better than that of Theorem 6 but is so complicated that it can be used 
only with difficulty even with a modern computer. For nonnegative 
covariances of class 2, Theorem 6 gives P[T,r(r)\ ^ % - T/4ir. This, 
together with (18), gives (10). 

Theorem 7 — If in the neighborhood of r = 0, 

then 

PlT,r M ] £ \ ~ I - I arcsin [, sin ©] , S T S T. , 

where T] = min (07t,t o ) and t is the smallest positive value of r for which 
r / T ) _ i _ 2\/j8- This theorem follows from the comparison Theorem 
1, the result (») of Section 1.1 and the fact (see Theorem 14, p. 494), 
that for ^ r ^ Ti , the covariance of Theorem 7 is dominated by 
r 8 (/5,T). 
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Theorem 8 — If r(r) is nonnegative and of class 2, then 
PBXr)] * J - £ - £ arcsin [^ sin ^] , Ojr^. 

This theorem follows from the comparison Theorem 1, the result (it) 
of Section 1 and the fact (see Theorem 13 in Section 2.7) that for ^ 
t ^ ir/v2j every nonnegative covariance of class 2 is greater than 
r 2 (l/V2,r). 

We conclude this section with a rather weak, but sometimes useful, 
result proved in Section 2.8. 

Theorem 9 — Let /i(£) be nonnegative for % £ ^ 6 and let h(l-) = 
for £ < and £ > d. Define 



and set 



Then 



G e (x) = f h(x + Z)h(£) rf£ 

J— 00 

•"•(t) = / r(r — £)G»(a;) dz. 



/>[?>( r)] 2: P[T + 0,r(r)]. 

1.7 Relationship Between P[T,r(r)] and F[\,r(T)] 

If r"(0) exists, then almost all sample functions X(t) possess a deriva- 
tive almost everywhere. If r"(0) does not exist, then almost all sample 
functions are nowhere differentiable. In this latter case, if a realization 
X(t) has a zero at t = 0, it almost certainly has infinitely many zeros 
in every right neighborhood of / = 0. In discussing F[X,r(r)], the 
distribution of the interval, /, between successive zeros of X(t), we ac- 
cordingly restrict our attention to covariance s for which r"(0) exists. 

The quantity P[7>(r)] - P[T + A,r(r)] is the measure of those 
sample functions which are nonnegative in (0,T) but are not nonnega- 
tive in ( — A,0), i.e., the measure of those sample functions that are 
nonnegative in (0,7') and have at least one axis crossing in ( — A,0). 
Divide this quantity by the probability vA + o(A) that X(t) have one 
or more upward axis crossings in ( — A,0) and allow A to approach zero. 
There results 

Q[T,r(r)} = -\^ P\T,r(r)} = 1 - F[T,r(r)]. (19) 

Here Q[T } r(r)] is the conditional probability that X(t) be nonnegative 
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in (0,T) given an upcrossing of the axis at t = 0; F[\,r(r)\ = Pr(Z ^ X) 
is the distribution function for the interval I between zeros. One should 
note carefully that the condition in the definition of Q is in the "hori- 
zontal window sense" (see Ref. 10, Section 2 for a more complete dis- 
cussion of this term). We shall find Q[T,r( T )] more convenient to deal 

with than F[T,r(r)]. 

From its definition, Q[T,r(r)] is nonincreasing as a function of T. It 
assumes the value 1 for T = 0. Like P[7>(t)], it satisfies the scaling 
laws 

Q[T,\r(r)) = Q[T,r(r)} 

Q[7>(Xr)] = Q[X7>(r)] (20) 

X > 0. 

For most purposes, then, it suffices to consider only class 2 covariances. 

In this case (see Ref. 19, Equation (3.3-10)) v = — and ( 19 ) becomes 

Q[T,r(r)} = -2T^P[T,r(r)i (21) 

Clearly upper and lower bounds on Q[T,r{r)], say 

Qv[T,r(r)} ^ Q{T,r(r)}, ^ T ^ T 
QdTAr)] ^ Q[T,r(r)}, 0£T£To, 
furnish bounds on P[T,r(r)] by integration: 

\~kf Qu[x ' r(T)] dx - p[r » r(r)i -\~-Ll Wr ' r(r)] dx ' 

However, since Q is nonincreasing, it is also possible to obtain weak 
bounds on Q from known bounds on P. For example, since Q is non- 
increasing, if b > a ^ 0, 

(6 - a)Q[a,r(r)] ^ j QllAr)} <h > (& - a)Q[b,r(r)\, 
or from (21) 

QWM] S 2, ?Mq - ™^ )] i QV,,r(r)). (22) 
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Thus if Pu{T) and Pi.(T) are respectively upper and lower bounds for 
P\T,Ht)\ valid for all T, 



max 27r^^ p^l <k Q[T,r(r)\ 

igr X — 1 



^ min 2w 

o<x<r 



Po(x) - Pl(T) 
T - x 



(23) 



Note that the left inequality of (22) for a = 0, b = T again gives (18). 
Also from (21) and the fact that Q is nonincreasing, it follows that 
P[T,r(r)] for class 2 covariances must be convex downward. 

To the author's knowledge, when the scaling laws (20) are taken into 
account, the only covariance for which Q[T,r(r)] is known explicitly 
is r 2 (/3,r) of (ii), Section 1.1. One has 



r 2 (p,r) = 1 - s + 2 cos flj , 



Q[TMr)\ = 



0, 



1 + 



cos (Is) 



4/1 -^in*(!)/ 



< £ 1, 



S j^ 2r, 



2tt ^ - < 00. 

" 



1.8 4 Comparison Theorem for Q[T,r(r)] 

Imposing the condition that X(l) have an upcrossing at t = in the 
horizontal window sense greatly complicates computation of probabil- 
ities associated with the process. For instance, when X(t) is conditioned 
in this manner, the random variables X(t l ),X(t 2 ),- ■ -,X(t„) are no 
longer jointly Gaussian. If r(r) is of class 2, their joint density is 



2tt / d$ &>(£, Xo , Xi , • • • ,.r„)*fl=o , 



where p(£, .r„ , Xj , ■ ■ ■ ,x„) is the Gaussian density of the unconditioned 
variables X'(0), X(0), X(ti),- -,X(t n ). 

It is possible, nevertheless, to derive a comparison theorem for 
Q[T,i-(t)\ and Q\T,q(r)\ for class 2 covariances somewhat in the spirit of 
Theorem 1. (See Section 2.9 for proof.) The function g(t) = ff-'IKO] 
plays a role here. Writing r = g(t), then q(r) = r(0. For a given value 
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of t, we choose g(t) as the smallest positive value of r for which q(r) = 
r(t). At t = 0, we have t = 0. As t increases from 0, so does t. One of 
two difficulties can occur as t increases: r(t) may reach a local minimum 
r(t ) at t = t„ before q(r) has reached its first local minimum, say g(n); 
t may assume the value n when t assumes the value t y ^ t . In the 
former case we define g(t) only for ^ t < t ; in the latter case, we 
define g(t) only for ^ t ^ h . The comparison theorem can now be 
stated as follows: 

Theorem 10 — Lei r(r) and g(r) 6e of class 2 and Ze< g(t) = q [r(t)] 
be defined as above. If for all nonnegative x and y with x + y ^ T , 

g(x) +g(y) ^ g(x + y), (24) 

then for ^ T ^ T 

Q[T,r{r)} ^ Q\g(T),q(r)l (25) 

It is easy to show that if r( r ) ^ g( r) in some neighborhood of the origin, 
then g(t) has the subadditive property (24) in some sufficiently small 
neighborhood of the origin so that the theorem is not vacuous. 

The steps which led from Theorem 1 to Theorems 2 and 3 are no 
longer valid when X{t) is conditioned to have an upcrossing at t = 0. 
We have found no analogue of these theorems for Q[T,r(r)]. 

By using (21 ), one can integrate the inequality (25) to obtain a more 
complicated comparison theorem for P[T,r(r)], namely 

P[T,r(r)} > \ + f (n *'(*)£ Ptog(r)U€ = P\ff(T) t q(r))/g' (T) 

- I"'" P\l,q{T)]h n (Z)d$, 

valid for g T ^ T . Here h(£) = a -1 (£) = r _1 [a(?)]. 

PART II — PROOFS AND SUPPORTIVE MATERIAL 

2.1 1 The Geometric Approach to P„ 

We wish to consider the probability P„(r) that n jointly normal 
variates, each with mean zero and normalized covariance matrix r, be 
nonnegative. Throughout this section we assume that r is nonsingular. 
Then P„(r) can be written as in (5). Denote the eigenvalues and nor- 

t The material in this section was developed in 1952. Many of the results have 
been obtained independently by other workers and have been reported in the 
literature. Cf. Plackett 41 in particular. 
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malized eigenvectors of r by X, and <r* = (tf-i'^V ■ 'i^O,*' = 1,2, • ■ -,n. 
One has 

S r«fe' - W, 

* (26) 

r<j = 2 tafrVA 

i '• • 

i,i = 1,2, • ■ •,». 
In (5) make the substitution j-, = J^* ^,*\/x*y* • There results 

P.(r) = (2tt)-" /2 jT • • • / dm • • • d2/.e-* W , 

where the region 7? is defined by 

Hi = £ ftVfeft £ 0, t - 1,2,- • ,». 

Denote by A„ the (n — 1) -dimensional content of the intersection of 
this region with the surface of the unit sphere having center at the origin. 
Then, by changing to a spherical coordinate system, 

P n = (2*)-*'*A n frfrr n - , ,- r2/2 = 4- n , 

where S, = 2ir n/2 /r(w/2) is the area of the unit sphere. Thus, P n is the 
fraction of the unit sphere on the positive side of the n hyperplanes 
Hi =_0. The unit normal a' to H t directed into R has components a*' = 
& Va* • From the last of (26), we find for the angle 0,, between a' and 
a', cos da = a -a' = rn . 

As mentioned in Section 1.2, expressions for the content A n of the 
spherical simplex in terms of the angles between its bounding surfaces 
are not known for n > 3. However, for the determination of P[T,r(r)] 
one is concerned with the limit as n — » qo of P n where the angles di/ n) 
are given, for example, by cos 0,/ n) = r[(i — j)T/n) with r(r) a given 
positive definite function. Thus, sufficiently tight bounds for P n might 
in the limit yield useful results concerning P[T,r(r)]. The geometric 
picture suggests a large number of such bounds. Unfortunately, none 
has been found which yields useful limits. Since, however, approxima- 
tions for the n-variable normal integral P n are of interest in their own 
right, we digress to mention several such bounds which may be useful. 
(See Ref. 42 for a bibliography on the multivariate normal integral.) 
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Circular cones with vertices at the origin can be inscribed and cir- 
cumscribed about the region R. The half -angle of the inscribed cone is 
found to be given by 

1 



sin Bi = 



4/X>.7-'' 



and the half-angle of the circumscribed cone is given by 

1 



cos u c = 



a/% r ijVr i r l Vrjr i 



(27) 



(28) 



The fraction of the unit sphere cut out by a circular cone of half-angle 
6 is 



FM = -±- /*' . f <W sin- 2 , = */.,*.. (?4-I , i) (29) 
Vir r ( n - 1 \ J o \ & *j 

where / is Pearson's incomplete beta function. 43 One has 

F n (6i) ^ Pn ^ Fn(Oc). (30) 

Bounds for P„ can also be written in terms of inscribed and circum- 
scribed Euclidean simplexes. The planes //, = intersect n - 1 at a 
time in lines which pass through the origin and a vertex of the spherical 
simplex. Let b i denote the unit vector from the origin to the vertex not 
contained in Hi = 0. One finds for the components b k l = ^. (Xt^i. ) 
and for the content of the Euclidean simplex determined by the origin 
and the end points of the b', 

G 1 (31) 

" n\ VI * 1 VO.ru- 1 

This simplex lies within the region of interest, The hyperplane through 
the end points of the vectors b* sec d c is tangent to the unit sphere. The 
Euclidean simplex determined by the origin and the ends of these vec- 
tors therefore contains the region of interest. Thus, 

^L< P < secngcG » (32) 

V n = " " V, ' 

where V n = r H,i /T(n/2 + 1) is the content of the unit sphere, 6 C is 
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given by (28) and G n by (31 ). Incidentally, for the cosines of the angles 
between the b's one finds the interesting reciprocal relations 

Bum b'b' = ■ Vii \ , ru = 



which is the natural generalization of the usual relationship between the 
sides and angles of a spherical triangle in three-space. 

One can expect the bounds in (30) to be close to each other when the 
b' are nearly coplanar, e.g., when all the entries of r are near unity. One 
can expect the bounds in (32) to be close to each other when the b' are 
nearly codirectional, e.g., when all the entries of r _I are nearly equal. 

An important differential recursion relation first derived by Schlafli 33 
for the content of the spherical simplex can be obtained in an analytic 
manner from the expression (5) for P H . We write 

P„(r) = / <&!••■ f dxrfnixi, ■•• ,*„;!■) (33) 

•'o Jo 

where the n-variatc Gaussian density is given in terms of its character- 
istic function by 

(ufa,.-*?) = r db ••• r diu*'*'*.-**'*** 

J QO ** QQ 



From this latter expression it follows that 

dg„ _ d 2 g n 
dr jk dXjdXk ' 



* > ./• (34) 



Here we regard g n as a function of the n{n — 1 )/2 variables r,-* , k > j, 
and recall that r,, = 1, r jk = r kj . Regarding P n as a function of this same 
set of variables, we find from (33) and (34) 

dPM r j r, a 1 . , 

arii J\) Jq ax\dXo 

Perform the integrations indicated on X\ and x 2 . There results 

d J^l = f dx 3 ■■■ f dx n g n (0,0&, ■ ■ -,x n ;r) ^ 0. (35) 

orii Jo Jo 

Now if g n is the density for the random variables Z x , • • • ,X„ , 

g»(-ri , ■ • V„ ; r) = p(xt , x-i)p(x 3 , • • ■ ,x n | x t , x 2 ), 

where p(x lt ,x->) is the joint density for X t and X> and 

p(x 3 , •• ,x n | Xi ,:r 2 ) 
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is the conditional density of X 3 , • • • ,X n given that X y = X\ and X 2 = 
Xa . In the case of Gaussian variates, these densities are well known 
Evaluating this expression at X\ = x 2 = 0, one finds 



g n (0,0,x 3 , •••,.r„;r) = 



1 



2tt\/1 - 



/ n _ 2 (x 3 , • ••,.r n ;r.i 2 ). 



rtf 



When combined with (35) and generalized for arbitrary indices, this 
yields 

3P„(r) 



1 



dr jk 



2irVl ~ r^ 



P»-i(r.y*) ^ 0. 



(36) 







* uv 


r fi r M 








r iv 1 r jk 1 








r k v r k j 1 I 




1 


r*j r^k 


S 


1 r„j 


r,k\ 


rm 


1 r jk 




Tjy 1 


rjk\ 


Tk* 


n, 


1 




r*» r k j 


1 



Here fjk is the customary notation of the statistician for partial corre- 
lation coefficients (see Ref. 40, Section 23.4 and pp. 318-319), so that, 
for example with n ?*■ j,k, v ?± j,k 



r^.jk = 



Equation (36) is Schlafli's celebrated differential recursion formula. 
His many relations connecting the angles of the boundary simplexes are 
familiar to the statistician as identities among partial correlation co- 
efficients. 

We close this section with a simple demonstration that for odd n, P„ 
can be expressed in terms of the content of lower dimensional simplexes. 
Let pi denote the probability that X, be nonnegative, p„ denote the 
probability that X { and Xj be nonnegative, etc. Then P„ = p a ... n ■ Set 
Mi = Sp, , Mz = 2><> Va » etc - Then from the well-known inclusion 
and exclusion formula, the probability Q„ that none of the variates be 
nonnegative is 

Q n = 1 - Mx + Mi- ••• + (-1)"M„. 

But from symmetry, P n = Qn = M n so that 

[I _ (-\)"}P„ = 1 - Mx + Mt- ••■ + (-l)"" l M n _i. 

(Cf. Sommerville,* 5 Chapter IX, Section 1.9.) No recursion is known for 
even n. 
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2.2 Proof of Lemma 1 

Lemma 1 follows directly from (35). Note that in the derivation of 
this result, it was not necessary to normalize the covariance matrix. 
This result thus states that if p is a position definite symmetric matrix, 

then 

^ 6 0, j> i, (37) 

with P„(p) defined by (5). 

Now let r and q be nonnegative definite n X n symmetric matrices with 
ru = Qu = 1. Then p = Ar + (1 - A)q + el, where I is the n X n unit 
matrix, is positive definite for each e > and each A satisfying ^ 
A g 1. Consider P H (g) as a function of A. It is readily established that 
P„(p) possesses a continuous derivative and indeed that 

dPn(g) m £ qPn(g) dpu 
d\ j>i dpu d\ 

v dP n ( 9 ) , 
= 2- , (rij - qij). 

i > i op a 



If now m ^ (ja ,j > i, (37) then gives 

dPn(o) 



d\ 



^ 0. 



Integration on A from to 1 yields P„(r + el) ^ P„ (q + e I). From 
well-known continuity theorems (see Cramer, 40 Section 24.3 and 10.7), 
Lemma 1 follows by letting e tend to zero. 

2.3 Proof of Theorem 2 

Let r(r) and q(r) both be of class a > and suppose that r(r) ^ 
q(r) for ^ t £ To . Then for any A > 1, 

r(r) ^ (j(t) ^ r(Ar) 

(38) 
< r ^ n (A), 

for some suitable t,(A). By Theorem 1, then, and the scaling law (2) 

P[T,r{r)] ^ P[T,q(r)] ^ P[XT,r(r)} 

(39) 
Ogri n(A). 

To see how best to choose A to obtain a good lower bound for P[T,q{ r) J, 
it is convenient to define a version of h(r) = r~ l [q( T )). Let r q be the 
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smallest value of t > for which q(r) is not decreasing. (Strictly speak- 
j ngj Tq = inf. of those T for which q(r) is not strictly monotone for 

< t ^ T. If this T set is empty, define r, = » .) Define r r in an anal- 
ogous manner. The function r~\q) is defined for 1 ^ g ^ r(7>) by the 
branch having values between and r T . Similarly we define q \r) for 

1 ^ r ^ q(rq) by the branch having values between and r q . If q{r q ) ^ 
r(r r ), we define h{r) = r _1 [g(r)] only for ^ r < g _1 [r(iv)]. If q{r q ) ^ 
r(r r ), we define h(r) for ^ r < r 3 . Clearly fc(0) = 0. As r increases 
from zero, h(r) is at first at least as large as r, since r(r) ^ q(r) near 
r = 0. For small r, r(/i) = g(r), so that h'(T)r'(h) = q'(r) or 

*'«,+ ) = Km <^ = lim £ = Urn (^Y"" = h'(0^)^ t 

so that A'(0+) = 1. Three typical curves for y = h(r) are shown in 
Fig. 1. Note that h(r) is strictly monotone in its domain of definition. 

Consider now the plots of y = h( T ) and y = Xr as shown on Fig. 1. 
For all values of X, these curves have the origin as a point in common. 
When X = 1, the straight line y = Xr is tangent to y = h(r) at the 
origin. As X is increased from 1, a second point of intersection moves 
out from the origin. It may happen, as in Fig. 1(a), that the line y = 
Xr becomes tangent to y = h{r). If so, we denote by T* the abscissa 
of the first such point of tangency as X increases from unity and we de- 
note the corresponding value of X by X*. If no such tangency occurs, we 
denote by T* the largest value of r in the domain h(r). In this case we 
set x * = h(T*)/T*. (Note that X* may be infinite.) Observe that for a 
given X < X*, the abscissa of the first point of intersection of y = Xr 
with y = h(r) to the right of the origin, say n , satisfies h(n) = Xri 
or q(n) = r(Xn). For r ^ n , the right inequality of (38) maintains; 
f or T = Tl + e,r(Xr) > f/(r) for small positive e. 

The lower bound P[\T,r(r)] on the right of (38) is a nonincreasing 
function of X for a fixed T. For a given T ^ T*, then, this bound is made 
as large as possible by choosing X as the smallest value greater than unity 
for which q(T) = r(\T). With this choice, XT has the value h(T) and 
Theorem 2 is proved. The largest T* for which the theorem as stated in 
Section 1.3 is true is the value T* defined in the previous paragraph. 

Note that if r(r) and q(r) cross at r > 0, i.e., r(r ) = q(r ), T* is 
necessarily less than r„ , for in this case, y = h(r) crosses y - r at r„ 
as in Fig. 1(a) and a tangency occurs as indicated. 

2.4 Proof of Theorem 8 

Let Ti > and 7' 2 > be given and set T 3 = T, + 7' 2 . Consider 
the approximation to P[T 3 , r(r)] given by the probability P„(r) that 
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all be nonncgative. Here = /, < t 2 < ■ ■ ■ < /„, = T x is a partition 
of (0,7\) and T x < n < r% < ■ ■ ■ < r,,,, = 1\ is a partition of (7', , 
T\ + 7'-..) and n\ + n-> = n. The covariance matrix r can be written in 
block form 



r = 



A B 

B C/' 



where A is an n x X n, normalized covariance matrix with elements 
r(U — tj) C is an n 2 X n 2 normalized covariance matrix with elements 



yf 



,y = ** r 


/U = r 


/ y = \r 


^-y=h(r) 


Jar\ 

r i i 


(a) 



71 T 1 



r 





Fig. 1 — The curve y = h(r). 
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r ( T . _ T .) t an d b has «i rows and n 2 columns and elements r(U - tj). 
Now 



\0 c)' 



is also a covariance matrix, and if r(r) ^ for ^ r £ Tt , the ele- 
ments of r are not less than the corresponding elements of f. From Lemma 
1, it follows that P„(r) ^ P n (f). But f is the covariance matrix for two 
independent sets of random variables so that 

Pn(r) ^ P»(*) = P. 1 (A)P n ,(C). 

By refining the partition with mesh tending to zero, one has P[T 3 ,r(r)] 
^ P[Ti ,r(T)]P[T 2 ,r{T)] and the theorem is established. (It is trivi- 
ally true if Ti or T 2 or both are zero.) 

2.5 Proof of Theorem 5 

Theorem 5 is a consequence of the following more general 

Theorem 11— Let the random variables Xi,X 2 , •• -,X n , n > 2 have 

a joint density p(xi , • • ■ ,x n ) with the property p(—Xi , • • -,-.r„) = 

p(xi, ■■-,x n ). Then 

PrlX, * 0,i - 1,2, • ■ ,») £ -1 + j^ E Pr|X, S OX, £ 0). 

The proof of this theorem follows that of a theorem by Gaddum 44 
concerning spherical simplexes and their angle sums. We introduce the 
following notations: P {j = Pr(X, £ 0, X, ^ 0), P = Pr{Z, ^ 0, i = 
1,2,- ■ -,n\, ^("i > «2 . ' • -» a ») " Pr (ai^i ^ 0, a 2 X 2 ^ 0, • • ■ ,a n X n ^ 0}, 
a,- = ±1,» = l,--,n. ThusP = 72(1,1,- • -,1) and 

X #( fl l » fl 2 I ' ' •> a «) = ll 



■li 



where in the sum each a takes values +1 and -1. The 2" symbols R 
are equal in pairs; 

#(01,02, •••,<*„) = .ft(-ai , -fls, •••,-o»). 
We call J?(-ai , -a 2 , • • -,-a») the complement of fi(a, , a„ , • • -,a„). 
One has 

p 12 = P + S , ie(l,l l Oa , a 4 , • ■ -,o«) 

p M = p + Z'22(l,Oj , 1,04 , ••• > o B ) (40) 

P„ ( „_d = P + 2'i2(o, , a 2 , • • -,a„_ 2 ,1,1). 
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Here the R symbol on the right of the equation having P,-y as left mem- 
ber has a 1 in the i th and j th places and a's elsewhere. In each equation, 
the sum is over all combinations of plus and minus 1 for the a's except 
for the combination all a's plus 1. 

Now consider adding the n(n — l)/2 equations (40). One has 

£P„ = [n(n - 1)/2]P+ S, 

where S is the sum of all the sums of R symbols on the right of (40). A 
given R symbol with precisely,/ of its arguments +1 will occur j(j — 1 )/2 
times in S, j = 2,3,- • ■ ,n — 1. Denote by T$ the sum of all R symbols 
that have precisely j of their arguments +1. Then 

v n{n - \) (n - l)(n - 2) 

2^ ry = s ? ~" 2 " _1 

,<J - 2 (41) 



J-2 



Now 



so that 



y i(j ~ 1) T ._ y? (n-Mn-J- Q r 

3=2 ^ 3=2 4 



gj^ r ,^g[^r, + <»-'•>(»-> -» r ._,]. 

But since an R symbol and its complement arc numerically equal, Tj = 
T n -i , so that (41) becomes 

Z p« = »<" ~ " p + (ra - 1) <» - ° r_ 

Now, for j = 2,3,- • • ,n — 2, 

j(j ~ 1) , (n -j)(n -j - I) > n(n - 2) 
2 2 4 ' 

so that 

£ p.. j. »(»-!) p + (»-!)(» -2) ^ 
»</ 2 2 

+ ^V^ 2 r, a 2<»f» p + fc^ £ *V- 

O ;=2 -S O , = l 



488 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1962 

However, the last appearing sum is 1 — 2P and Theorem 11 follows 
directly. 

In the case of a Gaussian process X(t) with normalized covariance 
function r(r), we consider the application of Theorem 1 1 to the random 
variables X { = X(iT/n),i = 1,2,- --.n. Then from (G), P„- - I + 
1/27T arcsin r[(i — j)T/n\. By taking limits as n becomes infinite, The- 
orem 1 1 then yields 

P[7>(t)] ^ -^ / <lil / d.v arcsin r(y - x). 

Elementary manipulations then lead to the result stated as Theorem 5. 

2.6 Proof of Theorem 6 

Consider n random variables, X, , X 2 , ■ ■ ■ ,X n , and the following 
mutually exclusive events: (A) the variables are all nonnegative; (B,) 
the first j variables are nonnegative and the (j + l) 8 ' is negative, j = 
1,2,3,- ■ ■ ,n — 1. The union C of these events is the event X\ ^ 0. We 
suppose Pr{C} = § and write P„ = Pr^},^ = Vr\Bj\,j = 1,2,- • -,n - 1 
so that 



But Vj ^ Pr{X x ^ 0, Xj ^ 0, X j+1 < 0\,j = 2,- ■ -,n - 1 so that 

re— 1 

P„ ^ £ - Pr{A r ! ^ 0,X 2 ^ 0! - E Pr(^i ^ 0,X, ^ 0, AVk < 0}. (42) 



Consider a stationary Gaussian process X(l) with a class 2 covariance 
/•(t). In (42) set Xj = X(jT/n). From (7), one obtains 

Pr{Xi ^ 0,X; ^ 0,X j+l < 0} 

= I + & [ arcsin r [ u - i} 3 " arcsin r D 3 - arcsin r [3] ' 

and from (G) 

1 1 . /T\ 

PrfXi ^ 0,X 2 ^ 0} = - - — arcsin r ( -J . 

Insert these values in (42) and pass to the limit as n becomes infinite. 
Theorem 6 results. 
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2.7 On Class 2 C ordinances 
Let r(r) be a class 2 covariance. From the Bochner representation 

/•(r) = / vos \rclF(\), 
•'o 

where we now have 

1 = f dF(\) = f \ 2 dF(\), 

•Ml JO 

it is not hard to show that r is continuous, that r'(r) exists everywhere 
and is continuous, and that r"(r) exists and is continuous everywhere 
except perhaps at r = 0. 

If the process X(t) with mean zero has r{r) as its covariance func- 
tion, then the four random variables X(0),X'(0),X(t),X'(t) have 
covariance matrix 



1 





r 


r' 





I 


->•' 


-r' 


r 


-/•' 


1 





r' 


-/•" 





1 



whore wo write r = r(t), r' = d/dt ?•(/), r" = d 2 /dt 2 r(t). For this to be 
a nonnegative definite matrix it is necessary that the determinant of 
all major diagonal submat rices bo nonnegative. Evaluating these deter- 
minants, one finds the system of differential inequalities 

(1 - r - r /2 )(l - r' 2 - r"-) - {>;■' + r'r") 1 ^ 0, (43) 

1 - r % - r rl ^ 0, (44) 

1 - r - r"' 2 £ 0, 1 - r' 2 - r"- ^ 0, 

1 - r ^ 0, 1 - r rl ^ 0, 1 - /•"'-' ^ 0. 

These inequalities can also ho derived without raising the question of 
existence of the derivative process by demanding that the covariance 
matrix of the four random variables X(0), X(e) - X(0), X(t),X(t+e) 
— X(t) be nonnegative definite for arbitrarily small values of e. 
Consider now the family of covariances 

r 2 (/3,r) = 1 - ? + /3 2 cos (j\, g |8 g 1, (45) 

introduced in Section 1.1. In what follows, we shall be concerned with the 
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family, F, of curves r = r 2 (/9,T), where for each with < ft ^ 1 we 
restrict our attention to the interval g r ^ tt/3. Several members of 
the family are shown in Fig. 2. The following statements, evident from 
the figure, are easy to prove analytically. ( 1 ) The curves of the family 
do not intersect each other except at r = 0. (2) A horizontal line r = 
r„ with | r 1 < 1 intersects exactly o nce each mem ber of F with param- 
eter value in t he rang e 1 ^ ^ V(l - r )/2. For each value of a 
satisfying - Vl - r 2 ^ a ^ 0, there is a unique member of the famly 
that intersects the line /• = r with slope a. If 0{a) denotes the param- 
eter value of this member of F, 0(a) is a continuous strictly monotone 
decreasing function of a, — y/\ — r,r ^ « ^ 0. 

We shall say that the curve r = r(r) intersects the curve r = (i(t) 
from below if at the point of intersection r' > g'. 

Lemma 2 — Let r(r) be of class 2. 

a. // the first local minimum of r(r) is at n , then r = r(r) cannot 
intersect from below any member of the family F, 

r = r 2 (/3,r) = 1 - /3 2 + (3 2 cos (£\ , g r ^ tt/3, fg & ^ 1, 



in the interval ^ t ^ ti . 

b. // r = r(r) passes down through the point (T„,r ) with slope rj 
satisfying — \/l — r a 2 ^ r ' ^ 0, £/iew //iere is a unique translated member 
of F, say r = r 2 (/3„ , t - m) wMcft passes through (r , r c ) witfi sZope r„'. 
If r 2 {0 o ,T — n) and r(r) are nonincreasing for f ^ r ^ t , Men t(t) 
^ r 2 (/3„ ,T-/i)/orf|T^r„. 




180 



Fig. 2 — The family /-'. 
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Proof — Part a of the lemma will be deduced from part b. The first 
conclusion of part b is the remark (2) above. The second conclusion of 
part b follows from the inequality (43). If | r | ^ 1, this latter can be 
written by elementary algebraic manipulations as 

1 ~ r 2 ~ r* < r „ jr^_ < 1 - / - r " 
1 - r 2 - T 1 - r 2 ~ 1 - r 2 ' 

The right-hand inequality can be rewritten as 



(1 - r) 2 (1 - rY ~ (1 - r) 2 ' 
or, if r' ^ 0, as 



2r'r" 2r' 3 ^ 2/ 



+ TT^ ?= ^ 



(1 - r)- (1 - r) 3 ~ (1 - r) 2 ' 
or 

±2± 1 



rfr (1 - r) 2 ~ drl-r' 
Integrate this expression from t to r with r < r to obtain 



r' 2 2 r/ 2 2 



(l- r )2 1 — r - (1 — r.) 1 1 ^7. ' (40) 

where the subscript o refers to quantities evaluated at r„ . Denote the 
right member of this inequality by - l/h 2 , and note that, as is indicated 
by the notation, 

1_ = 2(1 - ;•„) - r." (1 + r.)(l - r ) - r. n _ 1 - r„ 2 - r,/ 2 .. ft 
V (1 ~ r o y (1 - r„) 2 (1 - r o y- ~ °' 

by (44). Inequality (4(i) now becomes 

r' 2 - 2(1 - r) <, - 1 - (1 - r)\ 

or what is the same 



where 



- ^ ;72 (1 - r)(r - X), 

X = 1 - 2/j 2 . (47) 



It follows then that 



V(l - r)(r - X) = A 
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with h a nonnegative quantity. Integrate this again from r to t„ to ob- 
tain 

r - (1 + X)/2 . r - (1 + X)/2 r - r 

arCSin (1 - X)/2 - arCSm (1 - X)/2 = — 7^ • 

Thus one finds 

1 + X 1 - X . r r „ - r , . r, - (1 + X)/2 l 

r(r) ^ -f- + — sin [_-£- + arcsin (1 _ x)/2 j ( 48 ) 

- q(r). 

This inequality is valid in a r-range to the left of r until either q(r) or 
r(r) has a local maximum. 

Now by (47), q{r) can be written 

q( T ) = 1 — h 2 + h~ cos ( — t — ) , 

for suitably defined m, and one finds by using the various definitions 

q{T ) = r u 

q'(r ) = r.'. 

Thus q(r) is the member of the family F which, when translated in the 
r-direction, passes through the point (r , r„) with slope r '. To the left 
of T , the curve r = r(r) remains below this translated member of Z^. 
Part b is thus proved. 

Now suppose that r = r(r) intersects a member of the family F from 
below, say at (t„ , r„) with r„ ^ n . Let the parameter value of this 
member of F be ft, . Since ^ r'( To ) > r 2 '(0 o , t„), the translated mem- 
ber of F passing through (t„ , r„) with slope r'(T ) has a parameter 
value = 0i < 0o . This translated version of r = r 2 (0i , r) has no local 
maximum in the interval (0,r o ), and its value at r = is less than unity. 
One thus has the contradiction r(0) < 1 and the lemma is proved. 

Theorem 12 — Let r(r) be a class 2 covariance. Then 

r(r) ^ cos t, ^ t ^ ir. 
Proof: In a region where r'(r) ^ 0, inequality (44) implies 

-is- vr=^ = L 
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Integrating from r„ to t > r a assuming that r'{r) ^ throughout 
(t„ , t), one finds 

— (t — To) + arccos r„ ^ arccos r ^ (r — r„) + arccos r , 

where r = r(r ). This in turn implies cos[t — r — arccos r„] ^ r(r) 
and /-(r) ^ cos[t — t„ + arccos /■„], where the former inequality holds 
from r = t„ until the cosine assumes the value unity, and the latter 
inequality holds from r = r„ until the cosine assumes the value minus 
unity. The result may he stated as follows: Let the class 2 covariancc 
r(r) pass downward (= not upward) through the point (r„ , r„) in the 
T-r plane. The curve r = cos r can be translated in the r-direction 
to pass downward through (r„ , /•„). Then to the right of t , r = r(r) 
lies above this translated cosine curve until either the cosine curve or 
r(r) has its next local minimum. Similarly, a cosine curve can be trans- 
lated to pass up through (r„ , /•„). To the right of r„ , /• = r(r) lies below 
this translated cosine curve until either r(r) has its next local minimum 
or the cosine curve has its next maximum. 

A similar result holds if r(r) increases through (r„ , /■„). 

Now let T„ — 0, r — 1. Then /• = r(r) lies above r = cos r until the 
first minimum of either. If the first minimum of r(r) occurs at t\ ^ ir, 
the theorem is proved. Suppose now t\ < ir and that /• = r(r) crosses 
r = cos t in (0,ir). The first such crossing must be downward, since 
r(r) ^ cos t from to n. If the crossing is at f, then r(?) = cos f, 
and / -/ (f) ^ — sin f. If indeed r'(f) < — sin f, one obtains from (43) 
the contradiction 1 ^ r (f) + r' (f ) > cos z f + sin J f = 1. On the 
other hand, if the crossing takes place with r'(f) = — sin f, then b of 
Lemma 2 shows that r(r) ^ cos r for r < f which contradicts the 
assumption that the crossing was downward. Thus, the theorem is 
proved. 

Theorem 13 — // r(r) is of class 2 and 



TV 



then 



r(r) ^0, Ogrl-^ 



r(r) ^ r 8 (77= , t j = § + § cos V'2 t = cos 2 M|J 



/or 



494 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1962 

The theorem is a consequence of repeated applications of Lemma 2. 
We prove the theorem by supposing it false and then arrive at a con- 
tradiction. We refer to the curve r = r 2 (l/\/2,r),_0 ^ r ^ ir/\/2 as C. 

Suppose now that r(r) ^ for ^ t ^ ir/y/2 and that some point 
P on r = r(r), say (t , ?•<,), lies below C. Denote r'(r ) by r ' . We can 
suppose P chosen so that r ' < 0, since r = r(r) cannot be nondecreas- 
ing at all points where it lies below C. Let the horizontal line r = r 
through P intersect C at P x and denote the slope of C at Pi by C'(r ). 
The point Pi has larger abscissa than the point P„ . The curve r = r(r) 
possesses a continuous derivative. As the height r of the horizontal line 
r = r Q is continuously decreased to zero from its initial value, a value 
must be found with P to the left of P x and r,' ^ C'(r ). By b of Lemma 
2, a curve of the family F with parameter value ^ 1/V2 can be trans- 
lated to the left to pass through P with slope r„'. In the interval ^ 
r ^ t , this translated member of F lies strictly below C and is mono- 
tone. The first local maximum of r = r(r) to the left of P therefore 
lies below C as must also the local minimum just preceding this maxi- 
mum. A curve of F can then be translated to pass through this local 
minimum with slope zero, and repetition of the argument shows that 
all local maxima of r = r(r) for ^ r ^ r lie below C. In particular 
r(0) < 1, which contradicts the initial assumption concerning r(r). 
Q.E.D. 



Theorem 1 4 — Let the covariance r(r) have the behavior 

4 

T 

4! 



r(r) = l-'j + m^ + oiT'), 



near t = 0. Then 

with r 2 (p,T) given by (45). Here 1\ = min(/37r,r ) and t„ is the smallest 
positive value of t for which r(r) = 1 — 2/ra. 

Proof — The first four derivatives of r(r) exist at t = 0. From the 
Bochner representation for r(r), it is easy to show using Schwarz's 
inequality that 

v 2 = m - 1 ^ 0. (49) 

It also follows that r"(r) exists everywhere and is continuous. 

The Gaussian process X(t) having covariance r(r) has first and second 
derivates X'(t) and X"{t) almost everywhere with probability 1. The 



ONE-SIDED BARRIER PROBLEM 



495 



covariance matrix of the random variables X(0),X(t),X'{t),X"(t) is 



r r r" 

1 -1 
1 



-1 



in 



The determinant of this matrix cannot be negative. This is equivalent 
to the inequalities 



-v < 



r + r" 



Vl - r 2 - 



^ v. 



In any region where r' ^ 0, the right-hand inequality gives 

r'(r + r") d A - , 

Vl-r>- r' 2 = ~dr Vl ~ r - ~ r * = Vr ■ 

Integrate this from to r to obtain 

VI - r- - r' 2 £v(l - /•)• (50) 

Note that if n is the first positive value of r for which r'{r) = 0, (50) 
gives 



r( Tl ) ^ 



v ' - 1 
v- + 1' 



Thus we have the interesting side result that if r(r) is everywhere non- 
negative v ^ 1 or m ^ 2. 

Squaring the inequality (50) and rearranging the terms, one finds 



where 



r r - ^ (1 + if)(l -r)(r - a), 



\f - ] i 
a = < 1 . 

v- + 1 ^ 



(51) 



Since r' ^ 0, this implies 

r' . 

V (l-r)(r- a) = _Vl+u2 ' 

if r > a. Integration from to t yields 



r - (1 - a)/2 

arcsin — j- r - 7 — — 

(1 - a)/2 



-S^ -Vl +f 2 r, 
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where it is assumed that r ^ r . If then 

r - (1 + a)/2 . . (v /T—TTs ^ 

or, what is the same thing in virtue of the definitions (49) and (51), 

r(r) ^ 1 3 + "a cos ( mT )- 

m 2 m 2 

The theorem is thus proved. 

2.8 Proof of Theorem 9 

Let h($) be nonnegative f or ^ £ ^ and zero elsewhere. Then 

y(f) = f h(t - t')X(t') dt' = f duh(u)X(t - u) du, 

■It-e J -°° 

will certainly be nonnegative for ^ t ^ T whenever X(t) is nonnega- 
tive for - 6 ^ t ^ T. The probability that the Y process be nonnega- 
tive in (0,T) is therefore not less than the probability that the X process 
be nonnegative in (-6,T). If X is Gaussian with mean zero and covari- 
ance r(r), then Y is Gaussian with mean zero and covariance 

r B ( T ) = EY(t)Y(t + r) « [ du [ du h(u)h(v)EX(t - u)X(t +t - v) 
= I du I dv h(u)h(v)r(T - u + v) 

J— 00 •*— 00 

= r dxrir - x) f #*(* +*)*(*)• 

J-oo •/— 00 

One has then P[7>(t)] ^ p \- T + ^{r)\ t which is Theorem 9. 

2.9 Proof of Theorem 10 

LetO = U < k < ■■■ <L= T be a partition of (0,!T). Define Q„(r) 

by 

n , x _ Pr(X(fc) < 0,X(U) ^ 0,i = 2,3,--,») (52) 

y * U; Pr(X(«i) <0,X«,) ^ 0) 

where X(0 is a Gaussian process with zero mean and class 2 covariance 
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r(r). As the partition is refilled with mesh tending to zero, Q n (r) ap- 
proaches Q[T,r(r)] as a limit. The numerator on the right of (52) is 
P h (t) where 



r = 



1 -r(fr) -r(t 3 , 

-r{h) 1 r(h - k) 

-r(t 3 ) r(h - U) 1 

-r(t„) r(t n - tt) r(l„ - t z ) 



-r(tn) 
r(t„ - k) 
r(t„ - k) 

I 



, (53) 



and as usual P„(r) denotes the probability that n normal variates of 
mean zero and covariance matrix r l.c nonnegative. Note that the de- 
nominator of the right of (52) depends only on r(t 2 ). 

Let another Gaussian process, Y(t), have class 2 covariance q(r). 
Wedenne?" l (r),ry" 1 (r),/i(r) = r~ l [q(r)] as in Section 2.3 and set g(t) = 
Q^lrtD] - h'^t). Note that g(t) is strictly monotone within its domain 
of definition. Assume that T is within the domain of definition of g. 
With the points U given as in (52), set r, = g{U),i = l,2,--,«. The 
points = r, < t 2 < ■ ■ ■ < t„ = g(T) form a partition of the interval 
(0,0(7')). The mesh of this partition tends to zero with the mesh of the 
/, partition. 

Consider now the approximation to Q\g(T),q(r)] given by 

, Q ) - PHHti) <0,Y(t.) ^0,i= 1,2,3, ■ ■ ■ , n\ .... 
M) ~ Pr|F(ri) <0,Y( T ,) Z 0} ' (54) 

The numerator here is P„(q) where q is given by (53) with r replaced 
bygand * replaced by r. Since t,- = g(k),q(rt) = r{U),i = 1,2,- • -,n, so 
that the first row and column of f are the same as the first row and 
column of q. For any other entry of f with ti ^ tj , one has 

r(U ~ tj) = q[g{U - tj)] 

= q[n - tj + [g(U - tj) - g(U) + g(t } )\}. 

Since q{r) is nonincreasing 

r(U - t } ) S q(n - 77) 

and hence by Lemma 1 

P n (t) ^ PM), 
provided 

9(t, - ti) - g(t,) + g(t { ) i= 0. 
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or what is the same thing, provided 

g(x) +g{y) i= g(x + y), (55) 

where ^ x = tj < U - x + y. 

When (55) is satisfied, the numerator of (54) is not less than the 
numerator of (52). The denominators of these expressions are equal 
since they are the same function of r(t 2 ) = ?(t 2 ). Therefore, Q„(q) ^ 
Q„(r). The conclusion of Theorem 10 results by passing to the limit as 
the / partition is refined. 

2.10 Generalizations 

A number of the results presented in this paper can be generalized in 
a direct manner. We only mention here an obvious extension of Theorem 1 . 

In the derivation of Lemma 1, the lower limit of integration for x* in 
(33) can be replaced by a, . Now choose a, = a(U) with a(l) a given 
function defined for ^ t ! £ T, and where the points U form a partition 
of (0,7'). Proceeding as in the derivation of Theorem 1, one arrives at 
the following more general result. Let X(t) be a Gaussian process with 
EX{t) = 0,EX(t)X(s) = r(s,t). Let Y{t) be a Gaussian process with 
EY(t) = 0, EY(t)Y(s) = q(s,t). Then if 

r(s,s) = q(s,s), ^ s ^ T 

and 

r(s,t) ^ q(s,l), ^ s,t ^ T 
Pr{X(t) 2l a(t),0 ^ t ^T\ ^ T>v{Y(t) ^ a(t),0 ^ t ^ T\. 

2.11 Asymplotics 

As already remarked in the introduction of this paper, there appears 
to be little in the literature concerning the asymptotic behavior of 
P[T,r(r)] for large T. Intuition would indicate exponential falloff for a 
wide class of co variances. Example (it) of Section 1.1, though special 
in nature since r 2 (/3,r) is periodic, provides a counterexample to expo- 
nential behavior, and so the class must be carefully defined. Here, by 
the two bounds presented in Section 1.4, we have shown exponential 
behavior for nonnegative covariances that vanish identically for t 
greater than some r > 0. Recently, by using Theorem 1, M. Rosenblatt 
has established an asymptotic exponential upper bound for P[T,r(r)] for 
all covariances which are ultimately majorized by a decaying exponen- 
tial. This, together with the lower bound of Section 1.4, establishes the 
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asymptotic exponential behavior of P[T,r(r)] for all nonnegative co- 
variances that themselves decay exponentially. Professor Rosenblatt has 
also established that if r(r) — » with increasing r, then T"P[T,i-(t)] — » 
with increasing T for every n > 0. 

We conclude with the remark that from (23) of Section 1.7, one can 
show that asymptotic exponential behavior of P[T,r(r)] implies asymp- 
totic exponential behavior for Q[T,r(r)]. 
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